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Introduction 



The Hard Core Model/ Golden Mean Subshift/Independent Sets is a highly 
useful model in various disciplines as witnessed by its many appearances under distinct 
names in fields like Statistical Mechanics/ Symbolic Dynamics/Theoretical Computer 
Science respectively. Perhaps its most intuitive description is that of the interaction 
of identical hard particles whose only interaction is repulsion on contact. It has been 
studied for decades in various discrete set ups with some notable break throughs like 
Baxter's solution on the triangular lattice ([Bl]) yet its general treatment still seems 
elusive. 

The model can be considered in different regimes. In the high density/low 
temperature case the characterization of the allowed configurations is essentially a 
packing problem ([E]). Here we concentrate on the loose packing/high tempera- 
ture/entropic regime where configurations don't have such long range order and 
hence no striking geometric structure. The key unanswered question is the exponential 
size of the configuration space which in turn boils down to computation of the topolog- 
ical entropy. For almost all lattice set-ups the exact answer in not known. Here we try 
to alleviate the situation a little bit by establishing a procedure that in principle yields 
arbitrarily good estimates of the entropy on the lattices and more over also gives some 
insight to the structure of the measure of maximal entropy. 



1. Set-up 

Let L be a planar lattice. Subsequently we will consider mostly one of the regular 
lattices (square (Z 2 ), honeycomb (H) or triangular (T)). In a few cases we illustrate 
the principles developed on more exotic stages like square lattice with the Moore neigh- 
borhood (Z 2 M, nearest neighbors are within hop count one and two) or the Kagome 
lattice (K). 

A configuration on L satisfying the Hard Core Rule is an element in X = {0, 1} L 
where no two l's can be nearest neighbors. This rule can naturally be viewed as a zero 
range infinite repulsive potential i.e. a hard exclusion rule not unlike that in hard 
sphere packing. Call the collection of configurations X^f. 

The exclusion rule naturally imposes a sublattice split on L if it is a fc-partite 
graph. For example on Z 2 , a bipartite graph, one can man all sites on 2Z 2 (Z 2 rescaled 
by \[2 and rotated by 45°) with l's and the rest of Z 2 must then be all 0's. Call 
the former the even sublattice, L e and the latter the odd sublattice, L D (it is a 
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(1/2, l/2)-shifted copy of the former). In rendering these we will present the even/odd 
sublattices as circle/dot sublattices. In a similar fashion H splits into two identical 
sublattices and T (a tripartite graph) into three "thinned" copies of T. Both in the 
dense packing regime of [E] and in the the loose packing, entropic regime of this paper, 
this splitting will be highly relevant. 



Let X C X. The standard measure of variation in the assignments on the lattice sites 
is as follows (as usual | • | means cardinality and x\a means the restriction of x onto set 
A): 

Definition: The topological entropy of the set X is 



hf o = lim -ln|{xU„ \ x E X }\ 



''top 

where \A n \ = n and the sequence {A n } grows in a sufficiently regular fashion. 

Remark: For instance for the full shift on any lattice hf op = In 2 (indicating two 
independent choices per lattice site). If L = Z the hard core model is explicitly solvable 
and a standard transfer matrix argument implies that hf op = In ^ 1+ 2 V ^ ) (see e.g. [W]). 
For two and higher dimensional lattices the matrix argument breaks down and the 
exact value of the hard core topological entropy remains an unsolved problem. In this 
paper we try to approach and in particular approximate it in a novel way. 

From the general theory of lattice dynamical systems (e.g. [W]) it is known that shift 
invariant probability measures on a space of configurations, M., satisfy the maximum 
principle 

h top = sup hp 
M 

where /i M is the measure-entropy. The special measures yielding the equality are mea- 
sures of maximal entropy. For two and higher dimensional systems they are in 
general not unique. In all our subsequent cases they are believed to be so, but we do 
not actually need this knowledge. 

Note that if all the limits exist, one could write 



1 p 

(1.1) h top = -J2h 



(0 



i=l 



where h[ l J p is the topological entropy of the configurations on the i th sublattice when 
the original lattice partitions into p sublattices. However these entropies cannot in 
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general be computed independently but rather depend on each other heavily like in 
the case of hard core. However it is still possible to imitate (1.1) by introducing a 
sequential approximation of the measure of maximal entropy fx. 



2. Lower bounds 

We now proceed to establish lower bounds for the topological entropy using the sub- 
lattice partition representation (1.1) and a sequential fill-in scheme to overcome the 
dependencies. To keep the ideas clear we first treat the case of the hard core rule 
splitting the lattice to two sublattices and only after that generalize. 

Let iV e denote an all-0 nearest neighbor neighborhood of a site on the odd lattice in 
the even lattice. In the case of Z 2 lattice the sites in N e form the vertices of an even 
unit diamond, ^ e (<0> o ). On the honeycomb and triangular lattices these sites form 
triangular arrangements, A or y or a hexagon. 

It will become quite useful to think the fill-in in terms of forming a tiling. The 
pieces are 0/1-tiles which in Z 2 case are either 0/1-diamonds (as above) depending on 
whether the center site carries or 1. On the hexagonal and triangular lattices the 
tiles are 0/l-(unit) hexes. Once a sublattice is chosen, one can tile the plane using 
any combination of 0/1-tiles centered on the sublattice vertices. 

Recall that the Bernoulli measure with parameter p, B{p), assigns l's indepen- 
dently with probability p to each (sub)lattice site and O's otherwise. Its entropy, de- 
noted by }ib(p), is — p Inp — (1 — p) In (1 — p). 

Proposition 2.1.: The topological entropy of the hard core model on a lattice with 
a two-way sublattice split is given by 



where h top is the entropy of the measure of maximal entropy computed from to the 
even sublattice alone. 

Proof: The representation (2.1) follows from (1.1) by observing that the maximum 
entropy is obtained by first assigning the marginal of the measure of maximal entropy 
to the even lattice and then filling in the non-blocked sites on the odd lattice. These are 
centered at the even unit diamonds. The non-blocked sites must be filled with 5(1/2) 
to obtain the maximal entropy on the odd lattice, hence the factor /ig(l/2) = In 2. I 



(2.1) 
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The principle in the Proposition can be directly applied to square and honeycomb 
lattices. A further argument is required to cover all regular lattices. In the following 
result we present these arguments and further extend to Kagome lattice, K (a tripartite 
graph), as well as to the square lattice with Moore neighborhood, Z 2 M (eight nearest 
neighbors, a 4-partite graph). 

Theorem 2.2.: The topological entropy of the hard core model is bounded from below 
on the square (m = 4) and honeycomb (m = 3) lattices by 

(2.2) hzyu(p) = l{hB( P ) + (l-pr^2) , 
on triangular (W = 3) and Kagome (m! = 2) lattices by 

(2.3) h T/K (p,q) = l -{h B {p) + (1- P r'[ h B (q) + [1 - (1 -p)q] 2 \n2 ]} 
and on Z 2 M lattice by 

h Z 2 M (p,q,r) = -{h B (p)+(l -p) 2 [ h B (q) + [1 - (1 - p)q] 4 h B (r) 

+ (1 - pf(l - q) 2 [I - (1 - (1 - p)q) 2 r] 2 In 2 ] } , 
where p, q and r G (0, 1). 



(2.4) 



Proof: The lower bounds (2.2) follow simply from (2.1) of Proposition 2.1. by assigning 
B(p) to the even sublattice since then P (N e ) = (1 — py Ne ^ where the exponent is the 
number of elements in N e in Z 2 and H respectively. 

On the triangular lattice the sublattice split is three way. We call the parts the 
dot, circle and triangle sublattices. They are filled in three stages in the order o — > 
• — > >. See Figure la and b for the notation and arrangement of the sublattices in a 
neighborhood of a triangle site. 

Suppose the three sublattices are initially all empty. First fill-in the circle lattice 
with B(p), hence the entropy contribution \h B ( p y Then fill-in all dot sites centered at 
y with B(q), this implies the entropy increase |(1 — pfh B { p ) from the dot lattice. 

To update the center site which is a triangle we need to know that its value is not 
forced. Hence 

P (center triangle not forced by nearest neighbor circle or dot) 

= P(no l's in the hexagon of nearest neighbors of the triangle) 

= P(A = and v = 0) = P(c2 = c 4 = c 5 = and di = d 2 = d 3 = 0) 

i = P(di = d 2 = d 3 = | c 2 = c 4 = c 5 = 0) P(c 2 = c 4 = c 5 = 0) 

2-5 

= P(d 1 = d 2 = d 3 = | c 2 = c 4 = c 5 = 0) (1 - pf 
= [ P(d! = | c 2 = c 4 = c 5 = 0) ] 3 (1 - pf 

= [ P (ci = 1 or { Cl = and d x = 0} | c 2 = c 4 = c 5 = 0) ] 3 (1 - pf 
= [p+(l-p)(l-q)} 3 (1-Pf 
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which together with the choice B(l/2) on the non-blocked dots gives (2.3). 

The Kagome lattice argument is similar to the triangular one. There are three 
sublattices involved, all identical copies of the Kagome, only thinned and reoriented. 
For the nearest neighbors of a triangle-site see Figure lc. Again we fill in the order 
o — > • — > >. In the last stage the probability of the triangle site being unforced is now 

P(c2 = C4 = and d\ = d 2 = 0) 

= [ P (ci = 1 or {d x = and c x = 0} | c 2 = c 4 = 0) ] 2 (1 - p) 2 

= \p+(l-p)(l-q)} 2 (1-P) 2 

In the case of the square lattice with Moore neighborhood there is a four-way 
sublattice split. We denote and fill them in the following way: o — > • — > > — > o (see 
Fig. Id). 

The two first terms of the formula (2.4) are straightforward since circles are laid 
independently and each dot has exactly two circle neighbors. Furthermore as above we 
can show that P(> unforced) = (1 — p) 2 [p + (1 — p)(l — q)] 4 . 

For the diamond site at the center of Fig. Id to contribute to the entropy we need 
to know the probability that it is unforced i.e. all entries in the punctured square S 
rendered with dotted line in Fig Id. are 0's: 



P{S = 0) 

=P(all >, • G S are 0's | all o G S are 0's) (1 - p) 4 
=P(> G S are 0's I o, • G S are 0's) (1 - pf(l - q) 2 

(2 6) 

=P(di = 1 or {di = d 2 = and h = 0} | o,m G S are 0's) (1 -pf(l - q) 2 
= [ P(di = 1 I o,« g S are 0's) 

+ P(di = d 2 = and t x = 0) | o, • G S are 0's)] 2 (1 -p) 4 (l - qf 

One can compute the two probabilities in the last expression to be 

2p(l-p)q + (l-p) 2 (2-q)q 

and 

[p 2 + 2p(l - p)(l - q) + (1 - p) 2 (l - q) 2 ] (1 - r) 

respectively. From these the formula in the square brackets in (2.6) can finally be 
simplified to the form 1 — [1 — (1 — p)q] 2 r. I 
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Figures la, b, c and d. Tiling of the triangular lattice with hexagonal 1-tiles and 
neighborhoods in triangular, Kagome and Z 2 M cases. 



The entropy bounds in (2.2) - (2.4) can be maximized with respect to the parameters 
using standard optimization routines in a desktop machine. 



L 


max /i L 


sublattice densities 


best estimates 


Z 2 


0.3924 


(0.1702, 0.2370) 


0.4075 (0.2266)[mss],[B2] 


H 


0.4279 


(0.2202, 0.2371) 


0.4360 (0.2424) [B2] 


T 


0.3253 


(0.1457, 0.1559, 0.1517) 


0.3332 (0.1624) [B2 ] 


K 


0.3826 


(0.1944, 0.1948, 0.1866) 




Z 2 M 


0.2858 


(0.119, 0.127, 0.130, 0.126) 





Table 1. First lower bounds for Hard Core topological entropy and the corresponding 
sublattice densities for some 2-d lattices. To the right we have indicated the best 
numerical estimates for the entropy and corresponding density (in parenthesis) found 
in the literature. 

We note that while the topological entropy has been computed in the square lattice 
case to a great accuracy (e.g. in [B2] to some 40 decimal places) the corner transfer 
matrix methods used in these numerical studies attack the problem in a very different 
way. Our aim is not to compete in decimal count but rather present an alternative 
method applicable in many lattice set-ups to estimate the entropy which simultaneously 
yields some explicit information on the generic configurations/the measure of maximal 
entropy. 

The measure of maximal entropy doesn't need to be unique for a 2-d lattice model but 
in the case of hard square gas it is. This follows from the Dobrushin criterion ([DS], 
[RS]). Using this knowledge and the results above we now establish bounds for the 
density of l's in the generic configurations. The exact value of the upper bound in the 
following result is in the Proof but we prefer to give the statement in this more explicit 
form. 
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Proposition 2.3.: In the square lattice case the density of 1 's at the equilibrium is in 
the interval (0.21367, 0.25806). 

Proof: Let p e be the density of l's on the even lattice and let c denote the expected 
number of 0's that a 1 forces on the odd lattice. Since exactly half of the non-forced 
sites will be l's it must by the uniqueness of the measure of maximal entropy hold that 
(2 + c)p e = 1. Hence under it 

P(x 4 = 0) = ^ p( X4 = i) = J_ P«> e ) 



2+c v 7 2+c v w 2+c 

on both lattices. <> e is the 0-diamond as defined in the beginning of the section. 

The entropy of any distribution on the even lattice with 1-density p e is bounded 
from above by the entropy of the Bernoulli distribution with parameter p e . Hence the 
total entropy at that 1-density is bounded from above by 

l( hB ( _L) + J-hj 

2 V \2 + c) 2 + c 

This expression bounded by h top of Table 1 ([B2] or [MSS]) yields an upper bound for 
c, 2.6801 which in turn gives the lower bound for p e . 

The upper bound for p e follows from a lower bound for c which we establish using 
a monotonicity argument. The l's on say the even lattice are B( 1/2) -distributed on 
the no n- forced sites. Call this set F and pick a site on it which has a 1. How many sites 
will this entry block? Let F' be a superset of F. Then clearly E(c| F) > E(c| F') as in 
a bigger domain the 1 is more likely to share the blocking with a nearest neighbor 1 on 
the same sublattice. Hence a lower bound is obtained by calculating the blocking for a 
1 with its eight nearest neighbors also in F. Enumerating the 2 8 possible neighborhood 
configurations and weighting them uniformly according to the _B(l/2)-distribution we 
get the lower bound for c: 15/8. This in turn implies the upper bound for p e , 8/31. I 

Since our first estimate for the lower bound on Z 2 is associated with densities incompat- 
ible with Proposition 2.3. we will try out a symmetric variant of the theme. The (near) 
equality of the densities on the sublattices should be a natural property of a measure 
corresponding to a good lower bound since the measure of maximal density is believed 
to be unique in all our cases. In the last three cases in Table 1. the non-equality of the 
densities isn't far off but for the first two we present an "equalization" . 

Proposition 2.4.: To achieve equal densities of l's on each of the sublattices one 
needs to replace the 5(1/2) distribution in the last stage of the measure construction 
by B(p') and thereby In 2 in (2.2) by h B(p/) , where p' = p(l - p)~\ N <\. 
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Proof: In the case of two sublattices after B(p) distribution of l's on the even lattice 
there are a density of (1 — p)\ N ^\ unforcing neighborhoods on this sublattice. These 
have to produce the correct density of l's on the odd lattice, hence we need the even 
lattice flip probability p' to satisfy p'(l — p)\ Nf >\ = p. I 

Using the Proposition one can optimize the square lattice topological entropy bound to 
(a slightly worse value) 0.3921 at joint density level 0.2015. In view of Proposition 2.3. 
this indicates that the entropy generating l's are not yet packed in densely enough. In 
the case of the honeycomb lattice the corresponding values are 0.427875 at 0.2284. 

3. Higher order blocks 

To improve the entropy bounds and more importantly to get some insight into the char- 
acter of the measure of maximal entropy we now consider more complicated optimiza- 
tion schemes involving Bernoulli-distributed blocks on sublattices. We first illustrate 
the ideas on hexagonal and triangular lattices. 

A three-hex is a obtained by gluing together three unit hexes so that each has 
two joint sides. Figure 2a. illustrates three such three-hexes next to each other (for 
reference lattice edges are indicated as thin dotted lines in one of the unit hexes). Note 
that the unit tiles on each of them are all centered on the same sublattice, the circle 
lattice in this case (call the tile a circle three-hex). The dots of the other sublattice 
are all in the centers of the three-hexes or on their extremities (three of them are 
indicated). Three- hexes of the same orientation obviously tile the plane. 




Figures 2a, b and c. 3-hex arrangements in hexagonal and triangular cases. 

Let -B(p), p = (po,Pi,P2,P3) be the Bernoulli distribution on circle three- hexes with 
the probability that the three-hex has exactly k 1-tiles in it in a given orientation being 

(3) 

p k (so po + 3px + 3p 2 + P3 = !)• Its entropy is then h y B ' (p) = -p lnp - 3px Inpx - 
3p 2 lnp 2 -P3 hif>3- 
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Theorem 3.1.: Let a(p) = Po + %Pi + Pi- For the hexagonal lattice the Hard Core 
entropy is bounded from below by 

(3.1) h£\p) = ±{hg(p) + [p Q + 2a(p) 3 ] In 2} 
and for the triangular lattice a corresponding bound is 

hg\p, q) = Uhf(p) + [po + 2a(p) 3 ] h B (q) 

(3.2) 9 L 

+ 3[p 1 +p (l-?)]a(p) 3 (2-Q) 2 ln2} , 

where pi, q G (0, 1). 

Proof: For the construction of the measure we will fill in the lattice in the order o — > •. 
If the circle three-hexes are distributed Bernoulli with parameter p the entropy contri- 
bution from the circle lattice will be |^/is(p) where the factors result from the sublat- 
tice density and the fact that we distribute triples. As in Proposition 2.1. in the next 
stage the maximal entropy choice for the unforced sites on the dot lattice is the 5(1/2) 
distribution. The total density of sites available is computed at two different types o 
dot sites (as in Fig. 2a, the three dots indicated) and is | po + 2 (p + 2pi + p 2 ) 3 
where the coefficient 2 and the power 3 follow from the fact that at two of the thre( 
dot sites three adjacent three-hexes coincide. These formulas combined and simplified 
yield (3.1). 

On the triangular lattice a third sublattice enters and the fill-in order is then 
o — > • — > >. The entropy contribution from the Bernoulli circle three-hexes is now 
||/ie(p) since each sublattice is identical, hence of density 1/3. 

In the second stage the unforced dot sites are filled with B(q) distribution. Their 
density is computed as above to be | po + 2 (po + 2pi + p 2 ) 3 J , hence the entropy con- 
tribution from dot lattice will be this expression multiplied by ^B(q). 

In the final stage the unforced triangle sites are filled by B(l/2). Their density in 
the full lattice is 

-P (nearest neighbor o and • sites all 0's) 
3 



(3.3) = -| Pl (po + 2pi +p 2 ) [(pi + 2p 2 +p 3 ) + (po + 2pi +p 2 )(l - q)f 

+ po(p + 2pi + p 2 )(l - q) [(pi + 2p 2 + Ps) + (po + 2pi + p 2 ) (1 - q)f }, 



which results from considering the two different arrangements of four neighboring three- 
hexes as shown in Fig. 2c. (top and bottom cases for the top and bottom expressions 
in (3.3)). The formulas merged and simplified result in (3.2). I 
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L 


max /i L 


(p ,Pl,P2,P3), q 


sublattice densities 


H 


0.4304 


(0.504, 0.110, 0.048, 0.021) 


(0.2276, 0.2376) 


T 


0.3265 


(0.64, 0.092, 0,025, 0.010), 0.25 


(0.153, 0.155, 0.151) 



Table 2. Optimized lower bounds and densities for three-hex Bernoulli blocks. 



Remarks: 1. The Kagome lattice case can be done in an identical fashion. 
2. Note that apart from improvements in the entropy bounds, almost all of the sub- 
lattice densities have increased (in comparison to values in Table 1) indicating a better 
packing of the l's on the sublattices. Moreover they have significantly less variation 
which is to be expected since the densities are equal for the measure of maximal entropy. 

Let us now return to our original motivation, the Hard Core on the square lattice. 
Compounding the principles above and some further ideas we will implement an in- 
creasing sequence of lower bounds converging to the topological entropy. Along the 
way we'll get more explicit information on the configurations favored by the measure 
of maximal entropy. 

1-tiles in the Z 2 case are diamonds of side length \pl centered on either of the two 
sublattices. /c-omino is formed by gluing together k such 1-tiles along edges. If k = n 2 
and the 1-tiles are in a diamond formation we call them a n x n -blocks. There are 

2 

2 n of them. The optimization results in Section 2 were for the 1 x 1-blocks. 

Consider next 2x2 -blocks. There are 16 of them, but after assuming isotropy 
for them i.e. that blocks that are rotations of each other are distributed with equal 
probability (inevitable when measure of maximal entropy is unique), there are only five 
free parameters for Bernoulli distribution B(p) on them (p = (po,Pi,P2i,P22,P3,P4,), 
Po + 4pi + 4p2\ + 2p22 + 4^3 +f>4 = 1. Here the first subindex of p refers to the number 
of l's in the block and P22 and P21 denote the two different arrangement of two l's in 
the block (side by side and across)). 

The entropy contribution from the even lattice (on which we distribute first the 
l's using -B(p) is now 

(3.4) -^jpolnpo + 4pilnpi +4p 2 ilnp 2 i +2p 2 2lnp22 +4p 3 lnp 3 +p 4 lnp 4 | . 

The density of the unforced sites on the odd lattice can be computed from the three 
cases indicated in Figure 3a. and results in 

(3.5) j jp + 2 (po + 2pi +P21) 2 + (po + 3pi + 2p 21 + P22 +P3) 4 } • 
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These combined yield a lower bound for h top , which is optimized in Table 3 (second 
row). 




Figures 3a (3), b, c and d. n x n -blocks and update window in Z 2 case. Reductions 
in a 3 x 3 -block and the extension. 



block size 


max h Z 2 


sublattice densities 


red/init variables 


1 x 1 


0.392421 


(0.1702, 0.2370) 


1 


2x2 


0.39877 


(0.1993, 0.2254) 


5/15 


3x3 


0.4014 


(0.2073, 0.2254) 


46/511 



Table 3. Optimized lower bounds and densities for a few Bernoulli blocks on Z 2 . 

In the block size 3x3 there are initially 511 free block probabilities to optimize. 
When rotational invariance is imposed the variable number is reduced and additionally 
we will expect blocks that are reflections of each other to have equal probabilities at 
the optimum. After these two types of symmetries are accounted the number of free 
variables will be 101. 

In this size and in larger blocks another feature appears which enables further 
variable weeding. Consider the block in Figure 3c. The symbol assignments in sites 
x, y and z are irrelevant in the sense that the existing l's in the 3x3 -block already 
force all the odd sites (to carry 0's) that x, y and z might force if any of the them 
were l's. Hence there are 2 3 blocks of equal probability. This combined with the 
symmetry assumptions above yield the total of 64 blocks with identical probabilities 
at the optimum (this is actually the maximum reduction achievable in this block size) . 
Combing through the set of all blocks for this feature will result in reduction by a factor 
about 11 to the final set of 46 variables. Their optimal values have been computed and 
the results are in Table 3. 

Subsequently we call sites like x,y and z above weak with respect the rest of the 
given block. Only the corner sites of a block cannot ever be weak. 

The procedure of variable reduction is highly useful since the above rotational and 
reflection symmetry search as well as the weak site identification can be automated. 
Moreover the reduction improves significantly at every stage: for example in the next 
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block size of 4 x 4 the initial variable number of 65.536 shrinks 66-fold to 991 final free 
variables. 

Note also that the optima in block size n x n can be utilized as indicated in Figure 
3d to initiate the search in the next larger block size. Once e.g. the 3x3 subblock 
optimum probability is known, the added half frame (ei,...,e7) should be assigned 
B(p) entries with p computed from 3x3 blocks. With tailored optimization routines 
one should be able to deal with several thousands of variables in the larger block sizes. 
All the optimizations here were done with non-specialized code using Mathematica. 



The optimal block probabilities satisfy a useful monotonicity property, that we establish 
next. For this let B i} i = 1,2 be n x n -blocks, whose subsets of l's we refer to as 
Bf\ There is a partial order on the blocks via B\ ' using the ordinary set inclusion. 

2 

Let the optimal probabilities for the blocks be p = {po,Pi,P2,Ps, ■ ■ ■ ,Pi), I = 2 n (no 
reductions done yet and no particular order in the coordinates). 

Theorem 3.2.: Given two blocks B\ and B2 with optimal lower bound probabilities 
Pi and P2, if C B^ then p\ > p 2 . If B^ \ b[ contains only weak sites with 
respect to then p\ = pi, otherwise p\ > p2- 

Proof: The optimal lower bound is given by h(p) = {— ^2 i Pi m Pi + P(-^e) hi 2} 
where N e is the even 2x2 -diamond of all O's as in Section 2. Let Bi be such that 
.E?i C B^p and let p\ = p + e, p 2 = p — e, < |e| < p. Denote by /i e (p) the lower bound 
with the given pi and pi- To prove the result we will consider the entropy variation 
under the probability change of the two blocks: A/i e (p) = h e (p) — ho(p). More explicitly 

Ah e (p) = ^ I [ - (p + e) In (p + e) - (p - e) In (p - e) + 2plnp] 

(36) +[P 1 , e (iV e )-P 1 (iV e ) 

•P 2 . f (.V, ) P 2 (.V, ) 
+P 4 , e (iV e )-P 4 (iV e )]ln2}, 

where Pk, £ {N e ) and Pfc(iV e ) are the iV e -dianiond probabilities computed from the dif- 
ferent arrangements involving fc = 1,2 or 4nxn -blocks as in Figures 3a and b, for 
the block probability choices p ± e or p for both. 

By In (1 + x) ~ x the first square bracket behaves for small e like cie 2 , c\ < 0. 

If £2 \ B± contains only weak sites with respect to then the blocks Bi 
allow exactly the same sites to flip on the odd lattice hence each of the three last lines 
in (3.6) vanishes. The sole contribution to A/i e (p) then comes from the first square 
bracket and since this is negative for small but nonzero e, it must be that p± = pi at 
the optimum. 
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If \ contains non-weak sites with respect to B± let us first assume that 
they force k odd interior sites (recall that the odd sites are the vertices of the grids in 
Figure 4. There are (n — l) 2 such interior sites in a n x n -block). Let m be the number 
non-forced odd interior sites over block B\. Then 

Pl,e(iVe) - Pl(iVe) - {. . . + j—^ + (n _ 1)2 + • • 



pm p(m — k) \ fee 



(n-1) 2 (n-l) 2 "V (n-1) 2 ' 

where the dots refer to the contributions from the other blocks. All these terms cancel 
out, since the other block probabilities are identical. 

If non-weak sites only force odd interior sites then by geometry of the set-up 
the two last lines in (3.6) are immediately zero. If e extra odd edge, off-corner sites 
are forced, similar argument than above gives estimate (c 2 + ^li) ) 2 — c|, c 2 > for 
P2 i , E (iV e )— P2(iV e ) so the next to last line in (3.6) has the first order behavior 03c, C3 > 0. 
Some added bookkeeping yields P4 ie (iV e ) — P 4 (iV e ) = (04 + /e/4) 4 — c\^ 056, C5 > (I 
is the number of odd corners forced) . 

The leading order estimates for the four terms in the square brackets in (3.6) 
together yield c\t 2 + de, c\ < 0, d > 0. If there are non-weak sites in B^ \ B± with 
respect to , then d > 0. Hence p± > P2 must prevail at the optimum. I 



Remarks: 1. Intuitively the result says that if neither of two even blocks gives more 
subsequent choice on the odd lattice, for maximum entropy one should weight them 
equally. Otherwise one should favor the one giving more choice on the odd. 
2. One can readily see some chains imposed by the order in Figure 4: -< 12 -< 23 -< 31 
or -< 11 -< 21/22 -< 33 etc. The monotonicity can be utilized in limiting the number 
of n x n -blocks optimized for larger values of n (dropping blocks with least probability 
as dictated by the Theorem and with least multiplicity (most symmetric)). 

21 y\ 22 /\ 23 /\ 31 /\ 32 ^ 33 ' 

0.01069 0.00337 0.00597 0.00262 0.00479 0.00215 

Figure 4. Prevalent 3x3 -blocks with optimal probabilities without multiplicities. 




The correlation structure inside the measure of maximal entropy gradually presents 
itself in the Bernoulli approximations when we consider higher order blocks. Correla- 
tions between the blocks are zero because of independence, but within the blocks it is 
worth making comparisons. 
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By adding the optimum probabilities of all 3 x 3 blocks at a given density level 
k/9 = 0, 1/9, . . . , 1 we obtain the "density profile" of this measure (here k is the number 
of l's in the block). 

Suppose next that we generate the 3x3 blocks from lxl Bernoulli entries with 
the appropriate optimal p for l's (as found above). By adding these up we again obtain 
a density profile, this time for the lxl optimal Bernoulli measure at the resolution 
level of the block size 3x3. The 3x3 blocks can of course be generated using the 
optimal 2x2 blocks as well and yet another density profile results. These three discrete 
plots are rendered as curves in Figure 5. 

Perhaps the most notable feature here is the flattening of the distributions, as the 
block size increases i.e. the total block probabilities move towards the tails (while their 
means stay constant around 0.22). The curves cross between density levels 1/3 — 4/9: 
below this cross over the shorter range Bernoulli measures favor light 3x3 blocks, above 
it they discount heavier blocks in comparison to the optimal 3x3 Bernoulli measure. 

When examined closer one will see that the total probability of 3 x 3 blocks at a 
given density level essentially comes from at most three different kinds of local configu- 
rations (up to reductions above that is) . These seem to be "grown" : when moving from 
density level d to level d + 1/9 the high probability blocks are generated by adding a 
(contiguous) 1 into an existing high probability block. This mechanism cannot prevail 
when the 3x3 blocks are generated independently from smaller blocks. Consequently 
the small block curves in Figure 5. have suppressed tails. We expect this phenomenon 
to prevail in the higher order Bernoulli blocks as well and thereby to be a significant 
feature in the long range correlations of the measure of maximal entropy. 
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Figure 5. 3 x 3 -block occupation probabilities from Bernoulli blocks of size 3x3 
(diamond), 2x2 (square) and lxl (star), k e {0, 1, . . . , 9} is the number of l's in the 
block. 
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